Automatic Clustering of Utterances for a Dialogue Act Design

نویسندگان

  • Ryuichiro Higashinaka
  • Noriaki Kawamae
  • Kugatsu Sadamitsu
  • Yasuhiro Minami
  • Toyomi Meguro
  • Kohji Dohsaka
  • Hirohito Inagaki
چکیده

Automatic clustering of utterances can be useful for the modeling of dialogue acts for dialogue applications. Previously, the Chinese restaurant process (CRP), a non-parametric Bayesian method, has been introduced and has shown promising results for the clustering of utterances in dialogue. This paper introduces the infinite HMM, which is also a non-parametric Bayesian method, and verifies its effectiveness. We also analyze our clustering results to discuss how to derive useful insights for a better dialogue act design.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Discovery of Speech Act Categories in Educational Games

In this paper we address the important task of automated discovery of speech act categories in dialogue-based, multi-party educational games. Speech acts are important in dialogue-based educational systems because they help infer the student speaker’s intentions (the task of speech act classification) which in turn is crucial to providing adequate feedback and scaffolding. A key step in the spe...

متن کامل

Automatic Utterance Segmentation in Instant Messaging Dialogue

Instant Messaging (IM) chat sessions are real-time, text-based conversations which can be analyzed using dialogue-act models. Dialogue acts represent the semantic information of an utterance, however, messages must be segmented into utterances before classification can take place. We describe and compare two statistical methods for automatic utterance segmentation and dialogue-act classificatio...

متن کامل

A Quantitative View of Short Utterances in Daily Conversation: A Case Study of Thats right, Thats true and Thats correct

Short utterances serve a multitude of different communicative functions in interactive speech and have attracted due attention in recent research in dialogue acts. This paper presents a quantitative description of three short utterances i.e. that’s right, that’s true, that’s correct and their variations based on the Switchboard Dialogue Act Corpus. Particularly, it offers an overview to account...

متن کامل

Towards Speaker Adaptation for Dialogue Act Recognition

Dialogue act labels are being used to represent a higher level intention of utterances during human conversation (Stolcke et al., 2000). Automatic dialogue act recognition is still an active research topic. The conventional approach is to train one generic classifier using a large corpus of annotated utterances (Stolcke et al., 2000). One aspect that makes it so challenging is that people can e...

متن کامل

Dimensionality of dialogue act tagsets

This article compares one-dimensional and multi-dimensional dialogue act tagsets used for automatic labeling of utterances. The influence of tagset dimensionality on tagging accuracy is first discussed theoretically, then based on empirical data from human and automatic annotations of large scale resources, using four existing tagsets: DAMSL, SWBD-DAMSL, ICSI-MRDA and MALTUS. The Dominant Funct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011